AITopics | instance-specific prompt

Leveraging Hallucinations to Reduce Manual Prompt Dependency in Promptable Segmentation

Neural Information Processing SystemsMar-22-2026, 09:00:50 GMT

Promptable segmentation typically requires instance-specific manual prompts to guide the segmentation of each desired object. To minimize such a need, task-generic promptable segmentation has been introduced, which employs a single task-generic prompt to segment various images of different objects in the same task. Current methods use Multimodal Large Language Models (MLLMs) to reason detailed instance-specific prompts from a task-generic prompt for improving segmentation accuracy. The effectiveness of this segmentation heavily depends on the precision of these derived prompts. However, MLLMs often suffer hallucinations during reasoning, resulting in inaccurate prompting. While existing methods focus on eliminating hallucinations to improve a model, we argue that MLLM hallucinations can reveal valuable contextual insights when leveraged correctly, as they represent pre-trained large-scale knowledge beyond individual images.

artificial intelligence, hallucination, natural language, (13 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Natural Language (0.58)

Add feedback

c1e1ad233411e25b54bb5df3a0576c2c-Paper-Conference.pdf

Neural Information Processing SystemsMar-14-2026, 04:27:23 GMT

hallucination, instance-specific prompt, segmentation, (15 more...)

Neural Information Processing Systems

Country:

Europe > Switzerland > Zürich > Zürich (0.14)
Asia > China > Shanghai > Shanghai (0.04)
Europe > Poland (0.04)
Asia > South Korea > Daejeon > Daejeon (0.04)

Genre: Research Report > Experimental Study (0.93)

Industry: Health & Medicine > Therapeutic Area (0.46)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
(2 more...)

Add feedback

LearningaCondensed FrameforMemory-Efficient VideoClass-IncrementalLearning SupplementaryMaterials

Neural Information Processing SystemsFeb-11-2026, 21:31:14 GMT

We observe that the learned prompts have no intuitivesemantics.

artificial intelligence, dataset, learningacondensed frameformemory-efficient videoclass-incrementallearning supplementarymaterial, (11 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence (0.49)

Add feedback

LearningaCondensed FrameforMemory-Efficient VideoClass-IncrementalLearning

Neural Information Processing SystemsFeb-11-2026, 21:31:11 GMT

Recent incremental learning for action recognition usually stores representative videos to mitigate catastrophic forgetting. However, only a few bulky videos can be stored due to the limited memory.

artificial intelligence, arxivpreprintarxiv, machine learning, (15 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Leveraging Hallucinations to Reduce Manual Prompt Dependency in Promptable Segmentation

Neural Information Processing SystemsOct-10-2025, 15:39:48 GMT

However, MLLMs often suffer hallucinations during reasoning, resulting in inaccurate prompting.

hallucination, instance-specific prompt, segmentation, (15 more...)

Neural Information Processing Systems

Country:

Europe > Switzerland > Zürich > Zürich (0.14)
Asia > China > Shanghai > Shanghai (0.04)
Europe > Poland (0.04)
Asia > South Korea > Daejeon > Daejeon (0.04)

Genre: Research Report > Experimental Study (0.93)

Industry: Health & Medicine > Therapeutic Area (0.46)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
(2 more...)

Add feedback

c8ac22c0d4b263618f2a4f4657948912-Supplemental-Conference.pdf

Neural Information Processing SystemsAug-18-2025, 21:15:58 GMT

artificial intelligence, condensed frame, machine learning, (16 more...)

Neural Information Processing Systems

Country:

Asia > China > Shaanxi Province > Xi'an (0.04)
Asia > China > Hong Kong (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Learning a Condensed Frame for Memory-Efficient Video Class-Incremental Learning

Neural Information Processing SystemsAug-18-2025, 21:15:54 GMT

Recent incremental learning for action recognition usually stores representative videos to mitigate catastrophic forgetting. However, only a few bulky videos can be stored due to the limited memory.

artificial intelligence, machine learning, proceedings, (13 more...)

Neural Information Processing Systems

Country:

Asia > China > Shaanxi Province > Xi'an (0.04)
Asia > China > Hong Kong (0.04)

Genre: Research Report > New Finding (0.46)

Industry: Information Technology (0.46)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.68)

Add feedback

Leveraging Hallucinations to Reduce Manual Prompt Dependency in Promptable Segmentation

Neural Information Processing SystemsMay-27-2025, 15:23:09 GMT

Promptable segmentation typically requires instance-specific manual prompts to guide the segmentation of each desired object. To minimize such a need, task-generic promptable segmentation has been introduced, which employs a single task-generic prompt to segment various images of different objects in the same task. Current methods use Multimodal Large Language Models (MLLMs) to reason detailed instance-specific prompts from a task-generic prompt for improving segmentation accuracy. The effectiveness of this segmentation heavily depends on the precision of these derived prompts. However, MLLMs often suffer hallucinations during reasoning, resulting in inaccurate prompting. While existing methods focus on eliminating hallucinations to improve a model, we argue that MLLM hallucinations can reveal valuable contextual insights when leveraged correctly, as they represent pre-trained large-scale knowledge beyond individual images.

hallucination, leveraging hallucination, reduce manual prompt dependency, (10 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Natural Language (0.60)

Add feedback

Enhancing Visible-Infrared Person Re-identification with Modality- and Instance-aware Visual Prompt Learning

Wu, Ruiqi, Jiao, Bingliang, Wang, Wenxuan, Liu, Meng, Wang, Peng

arXiv.org Artificial IntelligenceJun-18-2024

The Visible-Infrared Person Re-identification (VI ReID) aims to match visible and infrared images of the same pedestrians across non-overlapped camera views. These two input modalities contain both invariant information, such as shape, and modality-specific details, such as color. An ideal model should utilize valuable information from both modalities during training for enhanced representational capability. However, the gap caused by modality-specific information poses substantial challenges for the VI ReID model to handle distinct modality inputs simultaneously. To address this, we introduce the Modality-aware and Instance-aware Visual Prompts (MIP) network in our work, designed to effectively utilize both invariant and specific information for identification. Specifically, our MIP model is built on the transformer architecture. In this model, we have designed a series of modality-specific prompts, which could enable our model to adapt to and make use of the specific information inherent in different modality inputs, thereby reducing the interference caused by the modality gap and achieving better identification. Besides, we also employ each pedestrian feature to construct a group of instance-specific prompts. These customized prompts are responsible for guiding our model to adapt to each pedestrian instance dynamically, thereby capturing identity-level discriminative clues for identification. Through extensive experiments on SYSU-MM01 and RegDB datasets, the effectiveness of both our designed modules is evaluated. Additionally, our proposed MIP performs better than most state-of-the-art methods.

information, module, re-identification, (14 more...)

arXiv.org Artificial Intelligence

doi: 10.1145/3652583.3658109

2406.12316

Country:

Asia > Thailand > Phuket > Phuket (0.05)
Asia > China > Shaanxi Province > Xi'an (0.05)
Asia > China > Zhejiang Province > Ningbo (0.05)
(2 more...)

Genre: Research Report > Promising Solution (0.48)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.66)

Add feedback

Prompt Customization for Continual Learning

Dai, Yong, Hong, Xiaopeng, Wang, Yabin, Ma, Zhiheng, Jiang, Dongmei, Wang, Yaowei

arXiv.org Artificial IntelligenceApr-27-2024

Contemporary continual learning approaches typically select prompts from a pool, which function as supplementary inputs to a pre-trained model. However, this strategy is hindered by the inherent noise of its selection approach when handling increasing tasks. In response to these challenges, we reformulate the prompting approach for continual learning and propose the prompt customization (PC) method. PC mainly comprises a prompt generation module (PGM) and a prompt modulation module (PMM). In contrast to conventional methods that employ hard prompt selection, PGM assigns different coefficients to prompts from a fixed-sized pool of prompts and generates tailored prompts. Moreover, PMM further modulates the prompts by adaptively assigning weights according to the correlations between input data and corresponding prompts. We evaluate our method on four benchmark datasets for three diverse settings, including the class, domain, and task-agnostic incremental learning tasks. Experimental results demonstrate consistent improvement (by up to 16.2\%), yielded by the proposed method, over the state-of-the-art (SOTA) techniques.

codebook, continual learning, learning, (12 more...)

arXiv.org Artificial Intelligence

2404.1806

Country:

Oceania > Australia > Victoria > Melbourne (0.05)
Asia > Middle East > Israel > Tel Aviv District > Tel Aviv (0.04)
Asia > China > Shaanxi Province > Xi'an (0.04)
(2 more...)

Genre: Research Report > New Finding (0.48)

Industry:

Health & Medicine (0.68)
Education > Educational Setting (0.46)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.68)

Add feedback

Filters

Collaborating Authors

instance-specific prompt

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

Leveraging Hallucinations to Reduce Manual Prompt Dependency in Promptable Segmentation

c1e1ad233411e25b54bb5df3a0576c2c-Paper-Conference.pdf

LearningaCondensed FrameforMemory-Efficient VideoClass-IncrementalLearning SupplementaryMaterials

LearningaCondensed FrameforMemory-Efficient VideoClass-IncrementalLearning

Leveraging Hallucinations to Reduce Manual Prompt Dependency in Promptable Segmentation

c8ac22c0d4b263618f2a4f4657948912-Supplemental-Conference.pdf

Learning a Condensed Frame for Memory-Efficient Video Class-Incremental Learning

Leveraging Hallucinations to Reduce Manual Prompt Dependency in Promptable Segmentation

Enhancing Visible-Infrared Person Re-identification with Modality- and Instance-aware Visual Prompt Learning

Prompt Customization for Continual Learning